Fast System Matrix Generation on a GPU Cluster

نویسندگان

  • Balázs Tóth
  • Milán Magdics
  • László Szirmay-Kalos
چکیده

This paper presents an algorithm for Positron Emission Tomography reconstruction running on a GPU cluster. The most computation intensive part of the reconstruction process, the forward projection, is re-interpreted as a geometric problem, that can efficiently be solved by the graphics hardware. We also investigate the possibilities to further increase the speed and to sidestep the texture memory limitations by using not a single GPU, but a cluster of GPUs. To do so, the iteration scheme is modified to minimize the communication need between the GPU nodes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Finite Element Matrix Generation on a Gpu

This paper presents an efficient technique for fast generation of sparse systems of linear equations arising in computational electromagnetics in a finite element method using higher order elements. The proposed approach employs a graphics processing unit (GPU) for both numerical integration and matrix assembly. The performance results obtained on a test platform consisting of a Fermi GPU (1x T...

متن کامل

An Approach in Radiation Therapy Treatment Planning: A Fast, GPU-Based Monte Carlo Method

Introduction: An accurate and fast radiation dose calculation is essential for successful radiation radiotherapy. The aim of this study was to implement a new graphic processing unit (GPU) based radiation therapy treatment planning for accurate and fast dose calculation in radiotherapy centers. Materials and Methods: A program was written for parallel runnin...

متن کامل

Ultra-Fast Image Reconstruction of Tomosynthesis Mammography Using GPU

Digital Breast Tomosynthesis (DBT) is a technology that creates three dimensional (3D) images of breast tissue. Tomosynthesis mammography detects lesions that are not detectable with other imaging systems. If image reconstruction time is in the order of seconds, we can use Tomosynthesis systems to perform Tomosynthesis-guided Interventional procedures. This research has been designed to study u...

متن کامل

Implementation of LU, QR and RNG in Flagon

This paper introduces fast LU and QR implementations on GPU which are extended from LAPACK routines. Using fast matrix-matrix multiplication algorithm on GPU, right-looking technique to parallelize the computation, look-ahead technique to override the CPU and GPU computation together with optimal block size on GPU make this implementation outperform its counterparts. It gains around 2~8x speedu...

متن کامل

Fast Cellular Automata Implementation on Graphic Processor Unit (GPU) for Salt and Pepper Noise Removal

Noise removal operation is commonly applied as pre-processing step before subsequent image processing tasks due to the occurrence of noise during acquisition or transmission process. A common problem in imaging systems by using CMOS or CCD sensors is appearance of  the salt and pepper noise. This paper presents Cellular Automata (CA) framework for noise removal of distorted image by the salt an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009